Clustering Algorithm for 2D Multi-Density Large Dataset Using Adaptive Grids
نویسندگان
چکیده
Clustering is a key data mining problem. Densitybased clustering algorithms have recently gained popularity in the data mining field. Density and grid based technique is a popular way to mine clusters in a large spatial datasets wherein clusters are regarded as dense regions than their surroundings. The attribute values and ranges of these attributes characterize the clusters In this paper we adapt a density-based clustering algorithm, Grid Density clustering using triangle subdivision (GDCT) capable of identifying arbitrary shaped embedded clusters as well as multi density clusters over massive spatial datasets. Experimental results on a wide variety of synthetic and real data sets demonstrate the effectiveness of Adaptive grids and triangle subdivision method. Keywords— Clustering, Density Based, Intrinsic Cluster, Adaptive Grids
منابع مشابه
An Adaptive LEACH-based Clustering Algorithm for Wireless Sensor Networks
LEACH is the most popular clastering algorithm in Wireless Sensor Networks (WSNs). However, it has two main drawbacks, including random selection of cluster heads, and direct communication of cluster heads with the sink. This paper aims to introduce a new centralized cluster-based routing protocol named LEACH-AEC (LEACH with Adaptive Energy Consumption), which guarantees to generate balanced cl...
متن کاملBreast Cancer Risk Assessment Using adaptive neuro-fuzzy inference system (ANFIS) and Subtractive Clustering Algorithm
Introduction: The adaptive neuro-fuzzy inference system (ANFIS) is a soft computing model based on neural network precision and fuzzy decision-making advantages, which can highly facilitate diagnostic modeling. In this study we used this model in breast cancer detection. Methodology: A set of 1,508 records on cancerous and non-cancerous participant’s risk factors was used. First,...
متن کاملA Distributed Algorithm for Intrinsic Cluster Detection over Large Spatial Data
Clustering algorithms help to understand the hidden information present in datasets. A dataset may contain intrinsic and nested clusters, the detection of which is of utmost importance. This paper presents a Distributed Grid-based Density Clustering algorithm capable of identifying arbitrary shaped embedded clusters as well as multi-density clusters over large spatial datasets. For handling mas...
متن کاملAdaptive Grids for Clustering Massive Data Sets
Clustering is a key data mining problem. Density and grid based technique is a popular way to mine clusters in a large multi-dimensional space wherein clusters are regarded as dense regions than their surroundings. The attribute values and ranges of these attributes characterize the clusters. Fine grid sizes lead to a huge amount of computation while coarse grid sizes result in loss in quality ...
متن کاملTask Scheduling Using Particle Swarm Optimization Algorithm with a Selection Guide and a Measure of Uniformity for Computational Grids
In this paper, we proposed an algorithm for solving the problem of task scheduling using particle swarm optimization algorithm, with changes in the Selection and removing the guide and also using the technique to get away from the bad, to move away from local extreme and diversity. Scheduling algorithms play an important role in grid computing, parallel tasks Scheduling and sending them to ...
متن کامل